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Abstract —This paper studies the massive MIMO full-duplex 
relaying (MM-FDR), where multiple source-destination pairs 
communicate simultaneously with the help of a common full- 
duplex relay equipped with very large antenna arrays. Different 
from the traditional MM-FDR protocol, a general model where 
sources/destinations are allowed to equip with multiple antennas 
is considered. In contrast to the conventional MIMO system, 
massive MIMO must be built with low-cost components which 
are prone to hardware impairments. In this paper, the effect 
of hardware impairments is taken into consideration, and is 
modeled using transmit/receive distortion noises. We propose a 
low complexity hardware impairments aware transceiver scheme 
(named as HIA scheme) to mitigate the distortion noises by 
exploiting the statistical knowledge of channels and antenna 
arrays at sources and destinations. A joint degree of freedom and 
power optimization algorithm is presented to further optimize 
the spectral efficiency of HIA based MM-FDR. The results show 
that the HIA scheme can mitigate the “ceiling effect” appears 
in traditional MM-FDR protocol, if the numbers of antennas at 
sources and destinations can scale with that at the relay. 

Index Terms —Massive MIMO full-duplex relaying, hardware 
impairments, transceiver design, joint degree of freedom and 
power optimization, achievable rate. 

I. Introduction 

In multi-user MIMO systems, one main challenge is the 
increased complexity and energy consumption of the signal 
processing to mitigate the interferences between multiple co¬ 
channel users. To achieve energy efficient transmission, the 
multi-user MIMO system with very large antenna arrays at 
each base station (known as “massive MIMO” system) has 
been advocated recently m. The key result is that, with very 
large antenna arrays at each base station, both the intracell 
and intercell interferences can be substantially reduced with 
simple linear beamforming (BF) processing m, El- 

On the other hand, full-duplex relaying (FDR) is a promis¬ 
ing approach to improve the spectral efficiency (SE) of relay¬ 
ing network while retains the merits of half-duplex relaying 
(HDR) (e.g., path loss reduction). In FDR, the relay transmits 
and receives simultaneously at the same frequency and time, 
but at the cost of a strong echo interference (El) due to 
signal leakage between the relay output and input 0 - To 
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mitigate El, three approaches have been investigated, i.e., 1) 
passive cancellation, 2) time-domain cancellation a and 3) 
spatial suppression a, 0 . The passive cancellation relies 
on a combination of path loss, cross-polarization and antenna 
directionality a. The time-domain cancellation is based on 
the fact that El signal is known at full-duplex node. Thus, it 
can be regenerated and removed in time-domain 0. In spatial 
suppression, El is mitigated with the multiple transmit/receive 
antennas by approaches such as null-space projection 0. 
Inspired by these works, a number of works have dedicated 
to the study of EDR protocol on both theory and testbed (See 
O, 0 and the references therein). To achieve spectral and 
energy efficient transmissions of multiple source-destination 
pairs, recent works sought to incorporate both HDR 0, ifTOll 
and EDR lim - lfT3l with massive MIMO. 

However, the aforementioned works on massive MIMO are 
actually based on the assumption that the base stations or 
relays are equipped with a large number of high-quality trans¬ 
mit/receive radio frequency (RE) chains (which are expensive 
and power-hungry). In contrast to conventional MIMO system 
(e.g., at most 8 antennas in LTE system), massive MIMO must 
be built with low-cost components in since the deploy cost 
and energy consumption of circuits will increase dramatically 
as the number of antennas grows very large. Such low-cost 
components are prone to hardware imperfections (e.g., phase 
noise, nonlinear power amplifier, I/Q imbalance, nonlinear 
low-noise amplifier and ADC impairments), which must be 
considered in the design of practical massive MIMO system. 

This paper focuses on the transceiver design for massive 
MIMO full-duplex relaying (MM-EDR) with hardware im¬ 
pairments. The effect of hardware impairments is modeled 
using transmit/receive distortion noises m, M- There are 
several challenges in the design of practical transceiver scheme 
in the considered system. The first is: how to deal with 
the El cancellation without instantaneous El channel? El 
cancellation is a critical problem in MM-EDR transceiver 
design which is not only important to reduce El power, but 
also useful to reduce distortion noises caused by hardware 
impairments at the relay (as will be shown in section IV). 
Different from EDR with small-scale relay antenna arrays 
0-0, the instantaneous El channel is usually not easy to 
obtain in MM-EDR. This is because the learning of El channel 
requires training sequence with length not less than the number 
of relay antennas 0 , which is prohibitive in MM-EDR as 
the channel coherent time is limited. The lack of El chan¬ 
nel makes traditional El cancellation techniques (e.g., time- 
domain cancellation and spatial suppression) difficult to apply. 
Although passive cancellation does not relies on El channel 
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estimation, it usually cannot provide satisfactory performance 
when used alone ID. Another problem is: How to suppress dis¬ 
tortion noises caused by hardware impairments at sources and 
destinations. Different from multi-user interference (MUI), the 
distortion noise caused by transmit imperfection of source can 
be viewed as an interference signal with the same channel as 
transmit data. Thus, it cannot be suppressed by relay antenna 
arrays during coherent combining (Similar problem appears in 
reception at destinations). This causes performance ceiling on 
achievable rate as the number of relay antennas grows large, 
which degrades the gain of massive MIMO significantly. 

In this paper, we propose practical transceiver scheme for 
MM-FDR with hardware impairments considering the above 
problems. Different from the traditional MM-FDR protocol 
na (where sources and destinations are equipped with single 
antenna), we consider a general model where sources and 
destinations are allowed to equip with multiple antennas. The 
contributions are summarized as follows: 

• We first examine the limitation of traditional MM-FDR 
protocol under hardware impairments. In particular, we 
derive the upper bound on end-to-end achievable rate 
for traditional MM-FDR protocol with linear processing 
at the relay. The bound reveals that the achievable rate 
is limited by the hardware impairments at the sources 
and destinations, and performance ceiling appears as the 
number of relay antenna grows large. The result also 
implies that it is impossible to cancel the “ceiling effect” 
with linear processing, if sources and destinations are 
only equipped with single antenna. 

• Based on the upper bound analysis, we propose a 
hardware impairments aware transceiver scheme (HIA 
scheme) to mitigate the distortion noises by exploiting 
the statistical channel knowledge and antennas arrays of 
sources/destinations. The scheme needs no instantaneous 
knowledge of El channel. The asymptotic end-to-end 
achievable rate of MM-FDR with HIA scheme (HIA- 
MM-FDR) is derived and the scaling behaviors of MUI, 
El and distortion noises are determined. 

• A joint degree of freedom and power optimization 
(JDPO) algorithm is presented to further improve the SE 
of HIA-MM-EDR. 

The paper is organized as follows. In section II, we review 
the related work. The system model is described in section 

III. The upper bound of achievable rate is analyzed in section 

IV. The HIA scheme is proposed in section V and the JDPO 
algorithm is presented in section VI. Simulation results are 
presented in section VII and some conclusions will be drawn 
at last. Notations : E(-) and var(-) denote the expectation and 
variance. I„ is nxn identity matrix. (•)*, (■)^ and (■)^ denote 
conjugate, transpose and conjugate-transpose, respectively. 
/9(A), Tr(A), [A]jj, A/(A) and U/(A) denote the spectral 
radius, trace, (i,j)th element of matrix, /th largest eigenvalue 
and eigenvector with respect to Zth largest eigenvalue of matrix 
A, respectively, a scales with b means 0 < lim ^ < oo. 

b—>-oo 

II. Related Work 

The design and performance of using unlimited number 
of antennas at the base station in cellular system were first 


considered for independent antennas Q, El, and soon ex¬ 
tended to the scenarios with spatial correlated antennas El- 
ifT^ . The asymptotic SINR for single cell cellular system was 
analyzed in El . It has been shown that, with very large antenna 
arrays at the base station, a deterministic SINR (also called 
the “deterministic equivalent” of SINR) which depends only 
on the large-scale fading of channels can be achieved, if the 
transmit power is scaled by N with perfect CSI and s/N 
with imperfect CSI (N denotes the number of base station 
antennas). The authors in ini-Eoi have done considerable 
work to derive the “deterministic equivalent” of SINR for 
massive MIMO system with spatial correlated antennas. 

In the field of massive MIMO relaying, the SE and EE of 
HDR with very large relay antenna arrays were investigated 
in ifTOll . |j9]- The MM-EDR with decode-and-forward (DE) 
relay was first introduced in im, im, and analyzed in end- 
to-end achievable rate as linear processing is employed. The 
asymptotic performance of amplify-and-forward based MM- 
EDR was considered in 1(131 and the scaling behavior of the 
infinitely repeating echo interference was determined. 

Only a few works considered the effect of hardware im¬ 
pairments on massive MIMO system. A constant envelope 
signal design has been proposed in 1(211 to facilitate the use 
of power-efficient RE power amplifiers. The authors in l(22l 
presented a low peak-to-average-power ratio (PAR) precoding 
solution to enable efficient implementation using non-linear 
RE components in massive MIMO system. In EDR with small- 
scale antennas, the optimal precoding under limited ADC 
dynamic range was studied in ina, M- However, these 
works cannot provide much insight for the effect of hardware 
impairments on MM-EDR. 

HI. System Model 

Consider the network with K source-destination (S-D) pairs 
and a single full-duplex relay R, where source Sk wishes to 
communicate with destination D/j (fc € {1, • • • ,K}) with the 
help of R. The relay adopts the DE policy. It is assumed 
that the sources and destinations are equipped with Ns and 
Nd antennas respectively, while the relay is equipped with 
Nji Nt antennas (Nji for reception and Nt for transmis¬ 
sion). We are interested in the large-{Nfi, Nt) regime, i.e., 
mm{Nji, Nt} —>■ oo. Ns, Nt> and K can be either fixed or 
scale with mm{Nji, Nt}. 

Let HsR^k G be the channel matrix from Sk to 

receive antenna array of relay and let HfiTi.k € cAtxWd 
be the channel matrix from Dk to transmit array of re¬ 
lay. Let 'HeI G ([^NrxNt denote the El channel matrix 
between transmit and receive arrays of relay. The spatial 
correlation of each MIMO channel is characterized by the 
Kronecker model 1(23(1 . Thus, ^ can be expressed as 
= \/PsR.kCy^jJ^sR.kC^sR.k^ '^here jdsR^k repre¬ 
sents the large-scale fading. CsR^k and CsR^k G 

(^NsxNs characterize the spatial correlation of received sig¬ 
nals across receive array of relay and that of transmitted 
signals across transmit array of Sk. ^SR,k G 
consists of the random components of channels whose el¬ 
ements are i.i.d with distribution CA/^(0,1). Based on the 
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same model, the channel matrices and H^/ can 

be expressed as B.RD,k = \/%iDj^C](^k^RD,k&RD,k 

= ^/PeiC^ei'^eiC. To facilitate the analysis, we 
assume the following conditions on correlation matrices, i.e., 
VC e {CsR,k,CsR,k,CRE>,k,CRD,k,CEi,CEi}, where 
kG{l,--- ,K}, 

• Al; The spectral radius of C is bounded by a constant, 
i.e., p(C) < C. 

• A2; C is a Hermitian and Teoplitz matrix and has unit 
diagonal elements. 

The former is a common assumption in the studies of massive 
MIMO which follows from energy conservation ifTTl and A2 
corresponds to the case of uniform linear array' (ULA) ll23l . 

To characterize the effect of hardware imperfections, we 
adopt the new signal model from Ha, M- 

1) Imperfect Transmit RF Chain: We model the effect 
of imperfect transmit RF chain by adding, per transmit an¬ 
tenna, an independent zero-mean Gaussian “transmit distortion 
noise”, whose power is proportional to the signal power 
transmitted at that antenna. The experimental results in ll24l . 
II 25 I have shown that the independent Gaussian distortion 
noise model closely captures the joint effect of imperfect 
components in transmit RF chain. Let X 5 ^ [u] G and 

xr [m] S denote the transmit vectors of source Sk and 

relay at time instant u. Based on the above model, the distorted 
transmit signals can be expressed as 


xs,k [u] = xs,k M + ts,fe M and xr [u] = xr[u] +tR [u] 

( 1 ) 

where the distortion noises ts^k [u] 

CA/" (|o, z/s^fcdiag xs,k [u\x^f^[u] and tR[u\ -- 

CM {D,vrX diag (E [xij [u] [m]])). Note that vs,k > 0 

{vr > 0) characterizes the level of transmit imperfection. For 
example, vs^k = 0 {vr = 0) corresponds to the conventional 
assumption of perfect transmit RF chains. The quality of 
transmit RF chains degrades as vs,k (er) increases. 

2) Imperfect Receive RF Chain: We model the effect of 
imperfect receive RF chain by adding, per receive antenna, an 
independent zero-mean Gaussian “receive distortion noise”, 
whose variance is proportional to the signal power received 
at that antenna. More precisely, let yR [m] G ^' and 
yD,k M G be the undistorted received signals of 

the relay and destination Dk at time instant u, the distorted 
received signals can be expressed as 


Yr M = YR M + M and yD,k [w] = yD,k M -f r^./c [u] 

( 2 ) 

where the distortion noises 
CA/'(0,/rfldiag(E[yij[M]yf [m]])) and ro^kbA ~ 

CA/'(0,p£),fcdiag(E[yD,fe[M]y|^^.['u]])). pr > 0 {pD,k > 0) 
characterizes the level of receive imperfection. The 
experimental studies in f26i have shown that the independent 
Gaussian noise model is a good approximation to the joint 
effect of imperfect components in receive RF chain. 

At time instant u, all sources transmit signals xs fc[u] (k = 
1, • • • ,K) to the relay simultaneously. Meanwhile, the relay 


*We will restrict our analysis to the scenario with ULA at each node. The 
analysis for arbitrary correlation matrices will be considered as further work. 


broadcasts the decoded signals to the destinations. Based on 
the models in Q and Q, the received signals at the relay and 
D/j can be expressed as 

K 

YR M = X/ {^s,k [u] + ts,k M) 

-f Uei {xr [m] -f tR [u]) -f Tr [m] -f Ur [m] 

yu.fc M = f, {xr [u] -f tR [u]) -f rD,k M + nD,k M 

(3) 

To keep the complexity low, it is assumed that a singe 
data steam is transmitted at each source and each node 
employs only linear processing. In particular, Sk transmits the 
unit-power symbol Sfc[w] using the unitary BF vector ps,k- 
Therefore, the transmit vector of Sk can be expressed as 
= ■y/£’s,fcPs,feSfcM, where Es^k denotes the transmit 
power. Relay combines the received signal by multiplying the 
receive BF matrix Wr = i.e., y^^ [w] = 

W^Yr [u]. The fcth element of Yr [u] 


yR,k M = VEs,k^R^kHsR,kPs,kSk M 

K K 

+ XI ^Ek^SRjPSjSj [m] -f X M 

j=t,j^k j=l 

+ ^R,k^Ei (Xfl [u] + tR [u]) -f wf fcrfl [u] -f MVR knR [u] 

(4) 

is used to decode the symbol of Sk- The relay for¬ 
wards the decoded symbol using transmit BF matrix 
Wt = The transmit vector of the 

relay can be expressed as xr [u] = W^A^^^s [rt — d], 
where s [u — d] = [si [u — d], • • • , s/f [u — d\]'^ and Ar = 
diag{ER,i, ■ ■ ■ ,Er^k) is the power allocation matrix at 
the relay, d denotes the processing delay of the relay. 

To meet the relay’s power constraint, xr [u] must satisfy 
Tr(E [x 7 j[-u]xf [u]])=Y,f=i ER^i<Efl^^. Dk uses the uni¬ 
tary BF vector PD,k to combine the received signal yu.fc [u]. 

The combined signal is expressed as 

yD,k M = VER^kPD,k^RD,k^T,kSk [u - d] 

K 

+ PD,k X V^J^RD,k^T,jSj[u-d] (5) 

j=t,j¥^k 

+ PD,k^RD,k^R M + PD.fcfu.fc M + pg.fcnu.fc M 


IV. Limitation of MM-FDR with Single Antenna 
Sources and Destinations 

This section analyzes the upper bound on achievable rate of 
MM-FDR with single^ antenna sources and destinations. The 
goal is to reveal the fundamental limitation of previous MM- 
FDR protocol with single antenna sources and destinations in 
combating distortion noises. Moreover, the bound provides us 
important insight on the design of practical transceiver scheme 
to mitigate the distortion noises, when multiple antennas are 
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available at sources and destinations (See section V). 


A. Upper Bound on Achievable Rate 

As Ns = Nd = 1, we set = pD,k = 1, replace 
(i G {SR,RD}) with and let Hi = , hi,_R-] 

and ts [u] = M j ’ ’ ’ i tg ^ [m]] . We consider a block¬ 
fading channel with coherence time T (symbol times), where t 
are used for uplink training of each source/destination, and the 
remaining T — 2Kt are used for data transmission. The upper 
bound on achievable rate is obtained by assuming perfect 
knowledge of h.sFi.,k, i>-RD,k and Hei can be provided with 
pilot signals, and meanwhile, the MUIs (i.e., the second terms 
of right-hand side of (Wli and 0) and El term [m] 

can be somehow cancmled^. Inis gives us the following upper 
bound on end-to-end achievable rate 


l|w_R,fc|| = ||Wo,fc|| = l 


where RsR,k and RRD,k denote the achievable rates of Sk 
R and R —> Dk channels, respectively. According to 0, 0 
and the assumptions in the above, we have 


RsR,k 

RRD,k 


T-2Kt, 
- tf. -l0g2 

T -2Kt 
T 





w 




X l0g2 



ER^k^E,k^RD,k^RD,k^T,k 


^RD,k®]i^RD,k + E 





(7) 


where Q is given as Q = UsR&ln^E + + 

+ with ©J = E [ts[w]tf [m]], 0]j = E [u]] 

and 0^ = E [rfl[u]rg [u]] denoting the covariance matrices 
of distortion noises t 5 [M], and respectively. 

1) Achievable Rate of Sk ^ R Channel: Since wji.k is only 
related with RsR,k, it can be optimized separately to maximize 
Rsfl.fc-From 0, the optimization problem is equivalent to the 
generalized Rayleigh quotient problem, which can be solved 
as yvE.k = iiQ-ihg^’^ii ■ The resultant upper bound on RsR,k 
(denoted by Rg^*^^) can be expressed as 


T — 9 Kt 

^^sTk = ^^l0g2 (1 + Es,kh^E,kQ-^hsR,k) 
T-2jTr f ^ Es,kh^EQk"^SR,k \ 

T ^ + VS.kEs^k^SR.kClf^h.g^^.j 

( 8 ) 


^The analysis for general case with multiple antennas at 
sources/destinations is more informative. However, the derivation is 
challenging since the BF vectors at sources, destinations and relay must be 
designed jointly to optimize the achievable rate, and closed-form solution is 
not available 0. The FDR with general number of antennas at each node 
was studied in ED (m However, their analyses assume perfect hardware, 
and thus give no insight on the effect of hardware impairments. Moreover, 
the methods in 1311 - 1331 cannot be applied in the considered system directly 
since the transmissions from sources to relay and relay to destinations are 
coupled due to the presence of distortion noises. 

^With H^;/, the El signal [u] can be perfectly canceled 

since x/j[u] is known at the relay. For MUIs, this can be realized by 
some sophisticated interference cancellation methods, e.g., the minimum mean 
square error with successive interference cancellation (MMSE-SIC). 


where is defined as = Q - ivs,kPs,kh.sR,kiisR,k 
is independent of hsR^k- The second step follows from the 
matrix inversion lemma Ezl. 

Theorem 1: Assuming that the assumptions A1 and A2 
hold. As Ns = Ne = 1, the achievable rate of Sk ^ R 
channel in the large-iVr) regime is bounded as 


RsR,k < R 


Upper 

SR,k 


T -2Kt 


X log 2 1 + 


T 

Es,kPsR,k^t: {CsR,k'^k) 

1 + VS,kEs,kPsR,k^'t{CsR^k^k] 


(9) 


where p = /J^/Tr -I- 1, ’4'fc (p) is determined by 

the following fixed-point algorithm with (p) = - 


(rt = (s, + eS + 

K /_ N 

= TrJ2 Rs,iPs,iI3srAnr + /Sfi/Tr I Cei ®]. + TrIne 

1=1 ^ ' 

Sk,i (p) = lim (5l 


n—>-oo ’ 

x(") _ t'S,lPS,lPsR,l 

^k,l — Nr 


xTr CsR,i 


K 

7^ S 


fS,iPS,jPsR,j^SR,3 




1 + 47'’ 


(P) 



Proof: See Appendix-B. ■ 

The convergence of the fixed-point algorithm in Theorem [T] 
has been proved in ESi . 

2) Achievable Rate of R ^ Channel: From 0, the 
achievable rates of —>^ i? and R —^ D/^ channels are coupled 
through ©]j = E [m]] = diag (E [WtA^W^] ), 

which makes the design of WT,k very challenging. However, 
with the following theorem we show that this coupling dis¬ 
appears in the large-iVr) regime, which allows a simple 
upper bound for achievable rate of i? —> channel. 

Theorem 2: Assuming that the assumptions A1 and A2 
hold. As Ns = Nd — 1, the achievable rate of Sk ^ R chan¬ 
nel is independent of w-r.fc. The achievable rate of i? —>^ Dk 


channel in the large-(A7, At) regime is bounded as ( 111 , 


which is achieved by the eigen BF = fiEiD,kl\\^RD,k\\- 
Proof: See Appendix-C. ■ 

With Theorems [T] and the upper bound of end-to-end 
achievable in the large- Nt) regime can be expressed as 


RW'" = min 


{r?s.rS"} ( 12 ) 


Remark 1: The bound in (12 1 is derived by assuming linear 
processing and fixed transmit powers at sources and relay, 
which is in general not capacity achieving. One exception is 
when K ^ mm{NE, Nt} and min{Vfl, At} —>■ oo. From 
Theorem]^ the design of receive and transmit BF matrices is 
decoupled as min{A7, Nt} —t oo. Moreover, the distortion 
noises are assumed to be circularly symmetric complex Gaus¬ 
sian distributed and independent of the desired signal. Thus, 
based on the results in ll29l . If30ll . linear processing along with 
power control is sufficient to achieve the capacity. When K 
min{iV«, Nt}, the terms ^ 




















5 


RflD.fe < R 


Upper 
RD,k ' 


T-2Kt 


log 2 1 + 


NTPRD,kER^k 


VRPRD,k Efi^i + ^D,k (^NT/3RD,kER^k + VRf3RD,k Y^f=i Er^I + l) + 


( 11 ) 




'i=i j'/fc 


’^S,jEs,jPsR,jCsR,j 

1+^i" '"'(P) 


in (101 vanish. By neglecting 


1 

Nr 

the low order term (in min{7Vii;, the bound in (121 

reduces to log 2 (l + L which is inde¬ 

pendent of Es,k and Er^^- Thus, (12 1 is the upper bound for 
capacity of 5^ —>■ i? —>■ Dk channel as K mm{NR, Nr} 
and minjAtfl, Nr} —>■ oo. 


B. Limitation with Hardware Impairments 

Different from Remark we consider the general case in 
which K can be either fixed or scales with min{At^, Nr}. 
With Theorem [T] and Theorem the effect of hardware 
impairments is still hard to analysis since ’4'* is not in closed- 
form. To make the analysis tractable, we assume Csr,i = 
CsR yi € {I,-- - ,E}. Using eig envalue decomposition 


CsR = on S^ki^P) ttt • 10'^ we have 


47 ip) = 


>^S,lPS,lPsR,l 


N. 


R 


Tr {Tjsr 


K 




^S,jPS,jf3sR,j 


j = 1 + ^k,l 


(n-1) 


ip) 


EiSR + ®R + pi 


I-Nr 


(13) 

By examining ( jlijl and the expression of in &44(p) 
is an 0(1) term. Note that this is valid for arbitrary n > 0. 
Thus, we can conclude that 6k,i{p) = 0(1)- Replacing Sk,i (p) 
in ’S'fc with the symbol 0(1) and substituting the result into 
Rsit.fe can be written as (14i. 

For convenience, we further assume 
{PsR.h PrD, 1 , Es,l, Er^rRS.I, Pd,i} = 

{^SR,I3rd,Es,Er,rs,Pd}- Note that this assumption 
does not effect the basis conclusions of the analysis. By 


neglecting the low order terms (in minjA^ij, Nr}) in (14i and 
the upper bound on end-to-end achievable rate reduces 
to 


pUpper _ 


T-2Kt 

T 


X log 2 1 -b min ■ 


_^_ 1 ) 

PD.k + ^k/R (1 -b PD,k) j J 


One can make several observations from 
1) As iT ^ minjA^fl;, Nr}, the effect of hardware impair¬ 
ments at the relay and El disappears in the large-(A^i?, Nr) 
regime and the upper bound converges to that in Remark 
The end-to-end achievable rate is limited by distortion noises 
caused by hardware impairments at sources and destinations. 
The explanation is that the transmit distortion can be viewed 
as an interference signal the same channel as data of Sk from 


§■ Thus, it cannot be suppressed after combining by WR k- 
The power of distortion due to receive imperfection of Dk is 
proportional to that of desired receive signal (after eigen BF) 
and also cannot be reduced by the transmit BF scheme of relay. 
Thus, there is a finite ceiling on end-to-end achievable rate as 
min{Wi{, Nr} —t oo. In fact, this poses a major limitation on 
MM-FDR with single antenna sources and destinations, which 
degrades the gain of massive array of relay greatly. 

2) As K scales with mm{NR, Nr}, the effect of hardware 
impairments at the relay and El is not negligible. Since the El 
channel is typically stronger than desired channels, the Sk 
R channel becomes the bottleneck of end-to-end achievable 
rate. This indicates that it is of great importance to suppress 
El to avoid the bottleneck effect. 

V. Hardware Impairments Aware Transceiver 

In this section, we consider a general model where sources 
and destinations are equipped with an arbitrary number of 
antennas. We first present a low complexity hardware impair¬ 
ments aware transceiver scheme (referred to as HI A scheme). 
Then the achievable rate of MM-EDR with proposed HIA 
scheme (HIA-MM-EDR) is analyzed in the large-(N r, Nr) 
regime and the effect of hardware impairments is discussed. 

A. Transceiver Scheme Description 

The optimal transceiver scheme for MM-EDR to optimize 
achievable rate is hard to find since the transmissions of 
Sk —t R and R —> Dk channels are deeply coupled due to 
the presence of El and distortion noises. Even without El and 
distortion noises, the problem has been proved NP-hard fT]. 
As VLei is an Nr x Nr matrix, the learning of VLei requires 
training sequence with length no less than min{W/j, Nr}. This 
task is prohibitive as the duration of channel coherent time 
is limited. Thus, we assume that passive El cancellation 
has been used and instantaneous El channel is not available. 
A low complexity HIA scheme is proposed to mitigate El 
(after passive cancellation) and distortion noises. As will be 
shown in simulations, the HIA scheme along with passive El 
cancellation provides satisfactory gain compared to HDR. 

1) Transceiver Design at the Relay: As the variance of El 
channel is typically stronger than desired channels, it is of 
great importance to control the El power. Erom section IV-B, 
El suppression is also important in reducing distortion noise 
due to hardware impairments of relay. We consider a two-stage 
BE scheme at the relay. The receive and transmit BE matrices 
are expressed as = PpWp and Wt = PrWr. 

Outer BF matrices: The outer BE matrices P 
(K <Ar< Nr) and Pt G C^txAt (K < Ar < Nr) are 
designed to suppress EL Ar and Ar are design parameters. 
Due to the lack of PLei, Pr and Pt are allowed to depend 
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^log2 


'SR,k 


T 


1 + 


1 = 1 \ 1=1,l^k J 

1 I ,, „ p ^ /r^ \ ( ^i(Csr) vs,iEs,iPsr,i , '^''(®h) , „ 

1 + vs,kPs,kPSR,k E A; K^sr) — E — 1 + 0 ( 1 ) -^- nP -^ P 


1=1,l^k 


(14) 


only on the statistical knowledge of El channels. With Q, 
design an average El power minimization problem, i.e., 


we 


(Pit,PT) 


= are mm 
Pr,Pt 


E 


.H 


yy.R,k^ R^Ei 


[PtW^Ar 


wfp|( 






EL < 


^jUpper 

Tr 


|w«,fc|| -(wTAflW|()Tr(: 


PgHs/PTP?H^,PJ^) 


K 


+ i/i? > Eji^k w 


R,k\ 


' Tr (P 




eiH^iPr) 


Then the problem ( [T^ can be rewritten approximately 


as 


Pr,Pi 


= are mm 

Pr,Pi 

yHj 


{t,( 


^t^rW.? 


X E [Tr (Pf HB,PTP|(Hf,Pi^)] 


K 




Y,ER,k^ [Tr(PfHijiHf,Pi^)] 

1^1 


are mm 
Pr,Pi 


{Tr(: 




Tr 


P^CeiPt 


K 


xTr{P^CEiPR) + vrNt Y, ER,k^r (P 


rCeiPr) 


Pfl = [uATn-Afl + l (Cb/) , UNj^-Ar+2 (Cei) ‘ 


(16) 

In ( [T6| , the design of Pr and P^ is coupled with the inner BE 
matrix, which makes closed-form solution inaccessible. Thus, 
it is of great interest to find new target function to decouple the 
problem. Eor such, using Cauchy-Schwarz inequality twice, an 
upper bound on average El power can be obtained as 


(17) 


criteria, e.g., maximizing the desired signal power which 
corresponding to the eigen BE, or minimizing the MUI which 
corresponding to the zero-forcing (ZE) scheme m. We adopt 
the latter one since the ZE scheme is known to approach the 
asymptotic limit (in min{7V/j, Wr}) of achievable rate faster 
as the number of relay antennas increases El, nni. Eor given 
BE vectors at sources and destinations, define effective channel 
vectors of Sk ^ -R and R —channels as 
Hsit./cPs.fc and 


pH 


and 


[iisjc.i, 

inner BE matrices can be written as 


^SR,k — 

PrHflB.fcPD.fc, and let = 




Urd 






the 


= H 


SR (^^SrMsr) 


-1 


= H 


iiRD {^RD^RD^ 


- 1/2 


( 20 ) 


(18) 

The multiplicative term ||wB,fc|| has been removed since it 
has no effect on the optimal solution. The second step follows 
from a similar derivation of ( |55] l. From ( fTS] ), the columns of 
Pr (Pt) are composed of the eigenvectors corresponding to 
the Ar (At) smallest eigenvalues of Cei (Cei), i-e.. 


where T is a diagonal normalized matrix with [T]; ; = 

(RrrRrr^ e/. In practice, IlsR,k and should 

be estimated in order to compute (201. This will be considered 
in section V-B. Note that (20i requires K < min{iVB,iVT} 
so that Ar and At can be selected to ensure the invertibility 
ofU^RHsR andHf^H^^. 

2) Transceiver Design at the Sources and Destinations: 
Similar to d, nil, we assume no instantaneous knowledge 
of channels at sources and destinations. This is reasonable 
since the amount of feedback could be very huge and unaf¬ 
fordable in MM-FDR. However, it is assumed that the local 
statistical knowledge of channels (i.e., {CsR,k, CsR,k) for Sk 
and {CRD,k,CRD,k) for Dk) can be obtained. As observed 
from 0, when Ns = Nr = 1, the achievable rates of 
Sk ^ R channel and R Dk channel are limited by the 
transmit distortion noise at Sk and receive distortion noise at 
Dk- This motivates us to design ps ^ and pu.fc to suppress 
these negative factors with the antenna arrays of Sk and Dk- 

Design ofps,k-' Intuitively, according to § we can design 
the following problem 


Pt = 


URj, — At-\-1 


{Cei) 


1 '^N't — Ax’+2 


(Cei 


, Uat, 


(Cb/)] 
(Cei 


Ps,k = arg 


(19) 

In ( [T9l l, Ar and At can be viewed as parameters to balance 
the allowable El power and available degree of freedom (DOF) 
for data transmission, which should be optimized with respect 
to specific metric. This will be considered in section VI. 


= arg 


max 
Ps+ll=l 


max 

Ps,fell=l 


Es,k^ 

Tr (llsR,kPS,kPs,k^SR,k'^ 

E 

Tr(UsR,k®l,Ti§R,)' 



Ps,k^SR,kPS,k 


Es,kAr 

[CsR,kdiag {ps,kPs,k) 



1 


= arg max -^Ps,k^SR,kPs,k 


l|ps.fc||=l vs,k 


Inner BF Matrix: The inner BE matrices Wp S C 


ArxK 


where 




and Wt, S C 


AtxK 


are designed to realize the multi-user 


IS the covariance matrix of ts,k, i-S-, 

. The second step 


communication. W p and Wj, can be designed with different 


s.fcdiag 


E[ts.fcMtgfcM] = US. 
follows from a similar derivation with that in 


PS,kPs,k 


( 21 ) 

®lk = 


and the 
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last step is based on assumption A2. From Q, the right-hand 
side of (21 1 can be interpreted as the average signal to transmit 
distortion noise ratio at Sk- The solution of problem (21 1 is 


PS,k = Ui 


(CsR,k^ 


( 22 ) 


Design of PD,k- Similarly, based on ([^, the received 
BF vector at Dk can be derived by solving the problem 

-Eji,fcE[Tr(HHri,fcpo,fcpg ),Hgp ),)] 


PD,k = arg 

results in 


max 


= 1 A‘D,fcPg,fcdiag(E[Hgj^ i,iiRn,k]')pn,k ' 


PD,k = Ui (C 


■'RD,k 


(23) 


B. Reduced Dimension Channel Estimation 

During a training phase, each source/destination transmits 
pilot sequence sequentially which allows the relay to compute 
the estimates of channels. With HIA scheme, it is sufficient to 
estimate the effective channel vectors and hpuj This 

allows a reduced dimension estimation scheme‘s. For brevity, 
we consider the estimation of j.. Let 0 S and Er 

denote the pilot sequence and power of each pilot symbol, f is 
multiplied by ps,k and transmitted by Sk- The received pilot 
matrix at the relay is expressed as 


{ps,kf^ + T^SR,k) + R-SJ?.fc + ^SR,k (24) 

where TsR,k G c^sxt (jgnotes the transmit distortion 
noise of Sk, whose Zth column has distribution (based 
on model CAf{0,us,kETdiag{ps,kPs,k))- ^SR,k G 

(^Nrxt denotes receive distortion noise at the relay. 
With model (|^, the hh column of TisR^k has distribu¬ 
tion CAf {0, fiRdia.g(E[ZsR,kei{ZsR,kei)^])). NsR,k is the 
AWGN matrix, whose elements are i.i.d and distributed as 
CAf{0, 1). By multiplying both side of (24i with P 


|j, we have 


-‘SR,k 


= h 


t^SR k4> +'PRE^SR,kTsR,k+'PR'R-SR,k+PR'AisR,k 

(25) 


From (251, as a merit of HIA scheme, the total length of pilot 
sequences to obtain the estimates of all effective channels can 
be as less as 2K (when r is set to 1). 

Theorem 3: The LMMES estimator of effective channel 


t^SR.k 


can be expressed as 


hsR.k — QsR,kEsR,kZSR,k 


(26) 


where CsR k i^ '^he covariance matrix of hsR k’ which can be 
expressed as Ogj^k = ^SR,kAi{CsR,k)'PRCsR,k'PR- ^R^k 
is given by %sR,k = hsR.k + Th'^R^SR,kTsR,k4>* + 
^pHRsR,kr + ^P^-NsR,kr and 




^SR,k + ' 


VS,k 


{pSR,k^' 


PsR,k (Ai {CsR,^ 


^SR,k 4 



^Direct estimation of iisR,k Urd fe (^ = ■ • • ? requires the 

total length of pilot sequences no less than K{Ns + N^). This results in 
long training phase that degrades the SE greatly as Ng and Nr are large. 


The real effective channel can be decomposed 

as hsR,k = hsR,k+ ^hsR,k with Ahgj^ k denoting 
the estimation error, k and k are uncorrelated 

whose covariance matrices can be expressed as k = 
QsR^kEsR.kQsR.k and CgR^k - QsR,k^ respectively. 

Proof: A sketch of the proof is presented in Appendix-D. 

■ 

Different from that with ideal hardware ll23l . the LMMSE 
estimator is not equivalent to MMSE estimator, since the 
received pilot signal is corrupted by the term 
(due to transmit imperfection of Sk), which is not independent 
with the channel to be estimated. There might exist non-linear 
estimator that results in smaller MSE. However, the difference 
should be small since the distortion noises are relatively weak. 

The LMMSE estimator of k can be obtained using 
the same approach. Eor further analysis, we dehne j. 

and f. as the covariance matrix and LMMSE estimates 

of f., respectively. Moreover, we dehne k = 

^RD kERD,kQRD k ^he covariance matrix of where 

TfiD.fc is given by 

ERD,k = I QRD,k 3 4 

4-^ - /3RD,k (^Ai (^CRR^k^ + VD,k^'^ 


C. Achievable Rate Analysis 

This subsection analyzes the achievable rate of HIA-MM- 
EDR. The achievable rate expressions are derived based on 
the bounding technique in lf34l . 

1) Achievable Rate of Sk —>■ R Channel: By treating 
yEE^E[w^i^llsR,kPs,k]sk[u] as the desired signal at Sk^ 
R channel, and approximating the remaining terms in (Q, i.e., 
yR,k M - \/Es,kH^R^k'^SR,kPs,k]sk[u], using the worst- 
case uncorrelated additive Gaussian noise with the same 
variance, the rate in ( |29l l is achievable on Sk—tR channel, 
where Elfc, MUlijfc, Cij^k ^ denote the El, MUI, 

effective distortion due to transmit imperfection of sources 
and effective distortion due to receive imperfection of relay 
respectively, which are given by 


E\k = wf .Hb/ {WtArW^ + E [tfl H tf M]) 

= X) Es^j w^j^HsrjPsj 

i = t,j=jtk 

Dfi,fc = wf j^E [tr [u] rf [u]] 

( 30 ) 

Wherein, the BE matrices at the relay are given by 


WR = pRUsR{nsRUsRy 
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^SR,k — 
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T-2Kt, 

\ 1 


Es,k 

E 

'^R,k^SR,kPS,k 

2 

rjA ^*^62 

1 Es^fevar 

^R,k^SR,kPS,k^ 

+E [EU]+E[MUl7?.fe]+E [ 

^7?.fc 

-f E 

^R,k 

-f E 

I|w7?.fcf 


(29) 


where H^ ^, • • • , (i G {SR, RD}) and Y is a di- 




-1 


agonal normalized matrix with [T]/,; = e;. 

Note that we have replaced the effective channels in inner BF 
matrices with their estimates obtained in Theorem |3] 

Theorem 4: Assume that the assumptions A1 and A2 hold, 
and Ar is selected so that Tr j.) = 0{Nfi). ^ With 
HIA scheme, the achievable rate of Sk ^ R channel in the 
large-(Afl, Nt) regime is given by ( [ 2 ^ , where the powers of 

desired signal and AWGN are Es^k 


Es^k and E 


|W_R,fc|l 


E 




R_k^SR,kPS,k 


— ({^^{QsR,k)) and 
As,fcvar {v^{{ f,UsR,kPs,k) = Es,k (Tr (QsR,k) ) 
E[EU] = 


SR 

k 


/^B/Tr (^Cei^r^ Tr 


(Tr(Cs^,,))^ 


E[MUI 


R,k. 


= E 


- QsR,j') ^SR,k 


(iy( 


^SR,k 


K 


vs,kEs,kgkk + Z) i^s,jEs,jgkj 

E[dLZ=—- 

(Tr [QsE,k 


E 


D 


Tr 


Tr (^CgE,k) 

E ^Sj/3sRjTr (^CsR,j^s,s 


J=i 


/^E/Tr ^Cei^r^ 


K 


^R — ^ Eg^, 


PtCee,i^t + ^fldiag (PtC^^, ;Py 


1=1 


Tr 




Effect of El: With the proposed HIA scheme, the scaling 
behavior of El is high correlated with AAr^_AH-i-i(C£; 7 ) 
and X]sfj.-AT+i{^Ei)- With the assumption Al, we have 
Aah-Ah+i(C£; 7 ) < C9(l) and AAr^_^j,+i(C£: 7 ) < 0 ( 1 ). 
In fact, under specific spatial correlation model (e.g., the 
physical channel model with a fixed number of angular bins 
ifTTll l. it is possible to make Atvh-Ah+i (Cb/) < ^>( 1 ) and 
XNT-AT+ii.^Ei) < 0(1) by selecting proper Ar and At- In 
this case, the power of El can decrease faster than 0{KN^^) 
in the large-(A 77 , Nt) regime. Note that this is in contrast with 
the result in lIT^ . which shown that the power of El decreases 
exactly with 0{KNg^). 

Effect of MUI: The scaling behavior of MUI confirms well 
with the classic result in MM-EDR with perfect hardware ifT^ . 
i.e., the MUI diminishes as Ng —oo if AT ^ Ng. 

Effect of Transmit Imperfection of Sources: (i) When 
Ng is fixed, an interesting observation is that the term 
Eg^kvai{w{^ i^'H.gE,kPs,k), which is widely convinced 
0{N^^) terms with perfect hardware, scales as 0 ( 1 ). 
As a result, the achievable rate of 5'fc —?► i? channel is 
limited by the joint effect of A 5 fcvar(w^^H 577 /jpg ^.) 
and the effective distortion due to the transmit imperfection 


2 

where is defined as = E[|Ahf^ ] and is 

given by Lemma [T] in Appendix-A. gkk and gkj are given by 
(jhljl and (62i in the Appendix-E. Els,k and Elg are expressed 
as 

^s,k = Ps,feP:^fe + J^s.fcdiag (ps,fcP:^fc) 


(33) 

The scaling behaviors of above factors are shown in Table |I] 
Proof: See Appendix-E. ■ 

Several important results on the effect of El, MUI and 
hardware impairments on achievable rate of Sk ^ R channel 
can be obtained from Theorem |4] and Table |I] 

^By increasing An, this can always be achieved since Tr 
approaches Nji as Aji increases from assumption A2. 


of sources, i.e., D^j f.. In fact, the variation of scaling 
behavior of Eg^k^ai{\irgjAisR,kPs,k) can also be 
viewed as a “negative effect” of transmit imperfection 
of sources. To see this, inserting the expression of 8^^ 
(given by Lemma in Appendix-A) into (32 1 and letting 
Nr -a 00, we can obtain Es,fcvar(w|[^.Hs 77 ,fePs_fe) = 
'^E77,feTr(Cs^^)p|;,diag(p^fcPs,fc)Ps.fc + 0{Nf^^), 

which demonstrates that Eg^k^ai:{'w^ ^'iisR,kPs,k) scales as 
0(1) as long as vgk > 0. (ii) When Ng scales with Nr 
and K Nr, it is seen that the proposed HIA scheme 
reduces the effect of transmit imperfection at sources by a 
factor of iZ. The reason is that the transmit distortion is 
pre-suppressed by the transmit BE vector ps,k- 

Effect of Receive Imperfection of the Relay: By comparing 
the result in Table and (15), it is seen that the HIA scheme 
does not change the scaling behavior of distortion noise caused 
by receive imperfection of the relay. However, the receive 
distortion noise power is still reduced, since El (which is a 
main contributor of receive distortion noise) is suppressed. 

2 ) Achievable Rate of R ^ Dk Channel: According to 0 
and the bounding technique in 041 . the achievable rate of 
R -A Dk channel can be expressed as 

PjHTA T — 2Kt 


HIA 

RD,k 


T 


X l0g2 1 + 


R,fc 






where MUI 


D,k, 


75fl,;,var(pg_,Hg^_,wr,fc)+E[MUlD.,]-fE[Dj,_;,]+D«_, + l 

(34) 

denote the MUI, effective 


D 


D,k 


and D 


D.k 
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TABLE I; Scaling behaviors of different factors in Theorem 



Desired 

Signal 

AWGN 


E[EIJ 

B[IVIUI„J 

e[d;,] 

e[d“,,] 

Perfect 

Hardware 

0(1) 

o(v»') 



0(A»-) 

0 

0 

Imperfect 

Hardware 

0(1) 

oK) 

fixed 

scales with 


0(A.') 

(!?(l),A 5 is fixed 

.0(max{w,T‘, 

scales with N/, 

o(kn-;) 


TABLE II: Scaling behaviors of different factors in Theorem 



Desired 

Signal 

AWGN 

^R,k 

I5[MUI„,J 

E[DL,J 

eK,-] 

Perfect 

Hardware 

0{N) 

0(1) 

0(1) 

o{n-') 

0 

0 

Imperfect 

Hardware 

0(N) 

0(1) 

0{N^),N^\s fixed 

0\ M scales with A. 

[ UJ 

o{n-') 

0(K) 


0(Ar),A^is fixed 

scales with Nj 


distortion due to transmit imperfection of relay and effective 
distortion due to receive imperfection of destinations, respec¬ 
tively, which are given by 


'PD.k^RD,k'^T,j 

■-.H riH 

^RD,k 

sR 


^D,k — PD,k^RD,k®]i^RD,k'PD,k 
= PD,k®D.kPD,k 


(35) 


Theorem 5: Assume that assumptions A1 and A2 hold, and 
At is selected so that Tr (C^^, j.) = 0 {Nt)- With HI A 
scheme, the achievable rate of Dk channel in the large- 
Nt) regime is given by ( [M| ), where the power of desired 

signal is Er^i, E[pg ^.), and 


RD 

k 


Er^u var (pf fcWT.fc) = ^Tr (Qrr j^^ ) Er^^S, 

^ ^-RjTr ^ (^Crr,— Crrj.'^ ^RDj 

E[MUb.fe]= 5] - 


Tr 


(CflDj) 


K 


E 


D 


D,k 


= vrY, 


Er^iEy ^Ci^£)fcdiag (^tQrd,i^t 


1=1 


P,R _ fJ-D,kER^k 
^D.k — 


Tr 


(QRD,k^ 


{Xi{CRD,k)) Tt{Crr^i) 
'^Pc.fcdiag {pD,kPD,k) PD,k 


V 


^Tr (^Crrj. 


-2 




H-K I _J- _I 

^ I ' 


Et JIhEt 


■ PRD,k ^Ai {Crd,^ + VD,k^ 




-1 


Ai 


{^RD,l^ 


Proof: The proof is similar with that for Theorem and 
thus it is neglected. ■ 

Erom Theorem and Table |I^ one can observe a strong 
similarity between the effects of hardware impairments on 
R —> Dk channel and that on Sk —>■ R channel. In particular, 
the effective distortion due to the transmit imperfection of 
relay D]^ k suppressed by the large transmit array of the 

relay, and it diminishes as Nr oo and K Nr- 

Moreover, when Nr is fixed, the achievable rate of i? —Dk 
channel is limited by the term ER^k var ^p^ k^RD k'^T.kJ 
and distortion noise caused by receive imperfection of Dk- 
However, as Nr scales with Nr and K <C Nr, these factors 
are suppressed by This is because that applying PD,k can 
be viewed as a post-suppression on receive distortion noise. 

With HIA scheme, the following end-to-end rate for Sk —>■ 
R —> Dk channel is achievable in the \dLi:g&-{NR, Nr) regime 


ER,jl 3 RR,j f ^ ^ pi/A I 

it y^RD^kPT^RRjPR I -b 


- E 

i=i 


E 


R,k 


ER,k 

(36) 


is defined as = E[|Ahf^ ^1 ] and is given 

by Lemma in Appendix-A. The scaling behaviors of above 
factors are shown in Table HU 


□ HIA • / dHIA dHIA 

Rfe — mm R_R_D,fe 


(37) 


HIA 


Different from that for upper bound in the last section, Rj, 
cannot be expressed in a simple form like 0. However, 
based on Table |I] and Table one can simply deduce that 
pjLower j.Qjjygj.gg jq ^ finite ceiling if Nr and Nr sue. 
fixed. However, with HIA scheme, the end-to-end achievable 
rate grows without bound as mm{NR, Nr) —)■ c» if Nr and 
Nr scale with mhi{NR, Nr) and K <C mm^NR, Nr). This 
is the same to the situation with ideal hardware d- 

D. Discussion on Hardware Design 

Based on the achievable rate expressions, we discuss the 
hardware design of sources, destinations and relay in this 
subsection. Although the achievable rate expressions hold for 
p)\bitrary K < mm{NR, Nr}, we will restrict our analysis 
HO K min{Afl, Nr}. This is a fundamental condition 
to ensure the benefit of massive MIMO, i.e., the MUI is 
suppressed by a large surplus of degrees of freedom HI- In 
MM-EDR with hardware impairments, it is also essential to 
control the number of S-D pairs (by, e.g., some user scheduling 
scheme) to suppress the El and distortions. 
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1) Hardware Design of Sources and Destinations: As Ns 
and Nd are fixed and small (due to e.g., size limitation), from 
Theorem 1^ and 1^ it is of great importance to reduce vs^k^ ’^D,k 
and gLo,k in order to alleviate the “ceiling effect” on achievable 
rate. This indicates that it is beneficial to use high-quality 
hardware at sources and destinations in this case. Note that, 
different from the relay side, the use of high-quality hardware 
at sources and destinations is affordable, since the increased 
cost scales with Ns and Nu, instead of minjA^i?, Nt}. 

When Ns and Nd scale with mm{Nji, Nt}, from The¬ 
orem 1^ and the distortion noise powers due to hardware 
impairments at sources and destinations are lower order terms 
when compared with desired signal powers. This means that 
the sources and destinations can decrease the hardware quality 
to some extent without degrading the performance. Later an 
example will be given to show how to achieve this. 

2) Hardware Design of Relay: Similar to that for sources 
and destinations when Ns and No scale with min{A^ij, Nt}, 
the distortion noises caused by hardware impairments at the 
relay are lower order terms when compared with desired signal 
powers, which makes it possible to decrease the hardware 
quality without hurting the performance greatly. 

Example: To get a clear insight, we consider the achievable 
rate of S' —> i? channel® given by Theorem We assume that 
there is only one S-D pair and let vn = ps.. When Ns scales 
with Nn, the achievable rate of Sk ^ R channel reduces to 

□HIA _ ^ ~ 

^SR,k — Tf 

(., 1 \ 


where ci and C 2 are positive constants independent of vs,i, 
Ps. {Ns,NTi}. In (38 1 , we have replaced Ns with 
the product of Ns and some constant. Assume we wish to 


achieve rate Rt- According to we have (as Ns 

OO) CiVsp + C2PS = _ o{Nf^^))Ns 

~ The result is encouraging since it 

implies that we can increase vsp and ps linearly as Ns 
increase (with Ns scaling with Ns) without degrading the 
achievable rate. As a result, inexpensive MM-FDR is possible. 


VI. Joint DOF AND Power Optimization 

In this section, we propose a low-complexity JDPO algo¬ 
rithm to maximize the SE (defined as the sum of all des¬ 
tinations’ achievable rates) of HIA-MM-FDR, subject to the 
maximum power constrains. The achievable rate expressions 
obtained in section V-C are utilized in the proposed algorithm. 
The algorithm needs only statistical knowledge of channels. 
Therefore, it can be computed offline at a central node (e.g., 
relay) and then broadcasts to the other nodes. 

Let and be the maximum transmit power 

constraints at Sk and relay, respectively, the SE optimization 

®We consider the achievable rate of Sk ^ R channel to simplify the 
analysis. Similar result can be obtained based on the achievable rate oi D 
channel in Theorem Therefore, the analysis is also valid for end-to-end 
achievable rate. 


problem can be formulated as follows 


max 


’ k=l 


0<Es,k<E^X^,k = l,--- ,K 

EEs,k<El^^ 

k=l 

2)-s G At S At 

(39) 

where As = {K,K + I,-- - ,Ns} and At = {K,K + 
1,---,Nt}. 

According to Theorem 4 and Theorem 5, we rewrite f. 
and as Rf^^ = ^ y^''’log 2 (1 + ysR,k) and R^^^ = 

^^=|^log 2 (1 -h ysD,k), where -iss,k and ysD,k denote the 
effective received SINRs of Sk —^ R and R —> Dk channels 
respectively, which can be expressed as 

Es,k 


TSfi.fc = —^ 


"(RD,k — 


E'i. + EU 

Tr {Cunf E, 




^k,j^R,j + RD,k + 1 


(40) 


where 


s / -h i^s,k9kj 

o-kj = - —t:-+ rrpsr,3 


Tr 


(Tr( 


S^SR,k 


{CsR,j^S,j 


^3 = 


Z^B/Tr (^Ei {^T^sD,3 ^t 4" E'sdiag 


Tr 


Tr j) 


'Tr(^Css,kHCEiPR 


Tr 


f (O-SR.k^ 


RR 


bk3 = 


SRD RrT'^ (CsD,k<iia.g (pTCsD,kPT 


Tr 


{CsD,k) {x3[CsD,k)) 'Tr{CsD,k) 

FD,fePB,fediag (pD.fepg.fc) PD.k RD.kRRjr {QRD,kPRD,k 


E'j' 


(Tp (c^D.fc)) 

I- PRD,k 


-1 


'RD,k 


tXi (^CsD,k^ Tr (QsD,k 


RD,k 


EtPr 


A = k 


Tr (Qsdj) 


vr 


Tr ^Ci^D,fediag {PtQsdjP 


{CsD,k 

RD.kPRD.Ar I \J,SD.kPTC 


-1 


Tr 


,fe/?itD,yTr {CsD,kPTCsD,3PT 


Tr 




3 


With the above results and the formula 


T-2Kt 


log 2 (1 + 7fe) = T^^ log 2 (nf=i (1 + 7fc)) ilk i 


is defined 
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as 7 fc = Inin{ 7 si^,fc, 7 i^_D,fe}), (39 1 can be rewritten as 


K 

min n (1 +7fc) 

( n . ^5,fe 

^ - 


-1 


j=i j=i 


> Ik, 


s.t. < 


k=l,2,--- ,K 

Tr(C^j^ ^ 1 o 

^2 ■ v^K c-l ■ I , I 1 > 'lk,K = 1,2, - ■ ■ ,K 


,K 


C3:Es,k<E^X-,k= 1,2, 

C 4 ■ E ^R,j — 
i=i 

, Cs : Ar g Ar, At G At 

(41) 

By using some algebra manipulations on the inequality con¬ 
straints, Cl, C 2 can be rewritten as 


■ E ^k,i^S,k^S,jlk + E ^k,j^S,k^R,nk 

i=i i=i 

+ ik-R + 1) (^Tr (CsR,k'j'^ < 1 

^2 : E ^k,j^Rlk^R,jlk + ikD,k + 1) -£'K,fe7fc < Tr (Qrd^^ 

(42) 

Problem ( |4T] l is a combinatorial optimization problem which 
is in general NP hard. To solve (|4T), we propose a JDPO 
algorithm to find suboptimal solution. Our strategy is as 
follows. First, using the similar approach as that in iTSl , we 
show that the power control problem with hxed Ar and At 
can be approximated as a geometric programming (GP) and 
solved efficiently using the convex optimization tools IMI . 
Then we present a heuristic approach based on sequential 
optimization to solve Ar and At- 

Power Control with Fixed Ar and At: From ( |4T] ) and ( |42l l, 
the inequality constraints are posynomial functions |[36l. As 
shown in iTSll . for any 7 ^ > 0 , 1 + 7 /c can be approximated 
using a monomial function near a point 7 /^, where Wfc = 

7fc/ (1 + 7fc) and Ok = (1 + %)■ By using this, the target 

function of ( |4T] i can be approximated as a monomial function, 
rifeLi ^k Ik'^’’ ■ ^ result, the solution for power control 

problem (with fixed Ar and At) can be obtained by solving 
several GPs. The algorithm is summarized in Algorithm [T] 
Optimization of Ar and At: An obvious approach to obtain 
Ar and At is via 2-D search. However, the complexity is very 
high since the power control problem described in the above 
should be computed for each {Ar, At). In this work, we 
present a heuristic approach to hnd suboptimal solution. In this 
approach, Ar (At) is searched over a subset Ar {At), whose 
elements are sampled from Ar {At). For example, Ar can 
be selected as {K, 2K, ■ ■ ■ , K}. The motivation of this 

idea is that the target function does not change dramatically as 
Ar and At increase/decrease by a small step as observed from 
the numerical results. Moreover, instead of 2-D search, Ar 
and At are optimized sequentially over the sampled subsets 
Ar and At via 1 -D search. The process repeats several times 
(much less than min{|7lK|, |71t|}) to improve the solution. 

A summary of the JDPO algorithm is presented in Algo¬ 
rithm]^ The algorithm converges to a local optimum since the 


Algorithm 1 Solve the power control problem with fixed Ar 
and At by GP. 


ii) 

1 : Initialization: Let 7 ^ denote the solution of 7 ^ after the 
fth iteration. Compute the initial value 7 ^°^ using (40 1 . Set 
a tolerance e and the maximum iteration times Lap. 

2 : For the (i + l)th iteration: 


(i) 


/(i+7') 


and Ou = 


Compute Wfc = 7 ^ 

Solve the following 

min n s.t. Cl ~ Cs 

Es.i,-,Es,K\ER,t,-,E r,k-,C =1 
7(i-ri) -(i) 

Tk -7fe 


If max 

fcl, - ,K 

Otherwise, set z = f-|-l and go back to step 2. 


GP problem 


< e or f -b 1 = Lcp, stop. 


Algorithm 2 JDPO algorithm for SE optimization. 

1 : Initialization: Set an initial At, i.e., and a maxi¬ 

mum repeat times L. Select the subsets Ar and At. 

2 : for all / = 1, • • • ,L do 

3: Set At = A^ and compute SE using Algorithm 

for all Ar g Ar. 

4: Update A^^ = argmaxSE. 

An r—I 

5: Set Ar = A^^ and compute SE using Algorithm M 

for all At G At. 

6 : Update Aip = argmaxSE. 

At 

7: end for 


target function is improved in each iteration. The complexity 
of Algorithm|^is upper bounded as L{\Ar\ -b |.4 t|)UgpCgp^ 
where Cgp is the complexity to solve GP when Ar = Nr 
and At = Nt. Usually, GP is solved using inner point method 
with polynomial time. The exact expression of Cgp is quite 
difficult and related with the structure of the problem. Some 
insights on Cgp can be found in [37, Sec. 11.5]. 

Remark 2: (JDPO for EE Optimization) Instead of the SE 
optimization, we can also formulate an EE (defined as the 
SE divided by total transmit power 12) optimization problem 
subject to a target SE Rt 


T min{R; 


HIA rHIA 
SR,k’'^RD 


4 


max 


Es,ic-- ^Es,k',Er^i,--- ^Er^k^Ar^At ^ -E's fe+IZ Er f 




S.t. < 


0 < Es,k < E^l^, k = l,- 


,K 


K 

E ta TT'max 

^R,k S 




(43) 


Using (|40|i, the above 


Ps,l 




max 

Z,Er^\,--- 


,Er^k',Ar,At 


problem is 

Ek=i ^s,k 


equivalent to 

Ek=i J s.t. 


(Ci-Cs and nf=i(l+7fc) = 2r^^^. With the 

technique in lf35l . the equality constraint can be converted to 
a monomial, and the power control problem (with fixed Ar 




















12 


and At) becomes a GP. Thus, ( |43| ) can be solving by using 
a similar JDPO algorithm as that in Algorithm 


VII. Simulation Results and Discussion 


This section presents the simulation results to verify the 
analyses in the previous sections. Throughout this section, we 
set vs^k = vjT k = vt), iiD,k = fJ-D for convenience. The 
correlation matrices of desired channels are generated with the 
exponential correlation model 



or Ci 


i,k 


-I IJ 




,ie {SR,RD} 


The model approximates the property of ULA, where the 
correlation between adjacent antennas is \ri^k\ G [0,1] and the 
phase of Vi^k describes the angle of arrival/departure as seen 
from the array. If39l shows how to map some of the parameters 
of ULA to this model. The correlation matrices of El channel 
Cei and Cei are generated similarly with a parameter tei- 
For convenience, we let \ri^k\ = tq, Vfc G {1, • '' jK}. The 
phases of ^ and tei are uniformly selected from [0,7r]. 

We assume that 25~35dB El cancellation can be pro¬ 
vided by passive El suppression techniques (more than 40dB 
cancellation has been reported by using such techniques for 
infrastructure node |0|). The variances of El channel (after 
passive cancellation) and desired channels are selected as 
Pei/ Pi,k G [0, 25]dB. With the path loss model in ID, the 
above range corresponds to the setup with the distances from 
sources/destinations to relay varying from 250m to 500m and 
10m segregation between relay transmit and receive arrays. 


A. Impact of Hardware Impairments 

This subsection considers the effect of hardware impair¬ 
ments on SE of MM-FDR. The channel coherent time is set 
to T = 300 and the length of pilot sequence is r = 2. 

Fig. 0 shows the SE of MM-FDR with single antenna at 
sources/destinations with different levels of hardware impair¬ 
ments. The SEs based on transceiver scheme in section IV 
(which achieves the upper bound when Ns = Ne = 1) and 
HIA scheme (with ps,k = PD,k = 1) are simulated. From 
Fig. B the SE is more sensitive to hardware impairments at 
sources and destinations. When vs = = d-D = 0.2^, the 

SE approaches to a finite ceiling quickly as the number of 
relay antennas increases. Similar results can be observed when 
sources and destinations are equipped with multiple but fixed 
number of antennas in Fig. However, the result changes 
when Ns and Ne scale with minjA^^j, Nt} and HIA scheme 
is used. From Fig.|^ as Ns = and Ne = similar 

SEs are achieved as sources/destinations or relay employ low- 
quality hardware, and no performance ceiling appears. This 
demonstrates the validness of HIA scheme. At last, it is seen 
that the asymptotic results in Theorem 0 & and Theorem 
&|5] match well with exact results. 

As the performance is affected by impairments of both 
transmit and receive RE chains at the relay, we compare the 
effect of transmit and receive imperfections on SE in Fig. 

We assume that the sources and destinations use high-quality 


100 


90- 


80- 


(73 30- 


20 - 


- Upper Bound (Exact) 

• Upper Bound (Theorem 1 & 2) 

---■HIA-MM-FDR (Exact) Perfect Hardwg 

HIA-MM-FDR (Theorem 4 & 5)1 



-1“ 

100 


Ceiling appe^s as Spurces/Desti- 
[ anfions use low-quality hardware. 


150 200 250 

Number of Relay Antennas (Nj^=N.|.) 


n— 

300 


Fig. 1: SE of traditional MM-FDR with single antenna at 
sources/destinations, where Ns = Ne = 1, K = 10, 
Es,k = Eji^k = 8dB, Et = lOdB, PsR,k = PRD,k = 
Pei = 1, To = 0.2, \rEi\ = 0.8, Ar = max{A:, 

At = max{Ar, [| }■ 


so¬ 

ys- 


N 70- 


40- 


30- 


^ HIA-MM-FDR (Ng=Njj=3, Theorem 4 & 5) 

• HIA-MM-FDR (Ng=floor(N^K), N^=floor(NyK), Theorem 4 & 5) 
-Exact _ 


Perfect 


Hardware . , 4- 2^- 

. . 


^v„=n„=0.01 , 


' i. 


% * 


P'j . 








03 as- fGU 






$i]nllar.SEs are.obtained 
as S^jDJor R uses 
Ibw-quallty hardware. 



oiling disappears as: Ng, 
N|^scale with min{N|^,N^} 




100 150 200 250 300 350 

Number of Relay Antennas (N =N ) 


Fig. 2: SE of MM-FDR with multiple antennas at 
sources/destinations. The setup is the same with Fig 


hardware (vs = vr = pR = 0.01^). From the figure, the 
effect of receive imperfection is more detrimental. The reason 
is that, with El at the relay, the power of distortion noise 
caused by receive imperfection is much stronger than that due 
to transmit imperfection. However, the performance difference 
decreases as Nr increases from 0.5Nt to Nr = 2Nt, since 
the power of effective receive distortion scales as 0{KNf^) 
(see Table |I|. This implies that use relatively higher-quality 
hardware or more antennas at the receive side of relay is 
beneficial. Using a similar setup, one can obtain a parallel 
conclusion for destination, i.e., the receive imperfection is 
more harmful than transmit imperfection. This is because that 
the transmit imperfection of destination only induces larger 
channel estimator errors, which is a part of the received signal 
at the destination. Thus, based on model (|^, the distortion 
noise due to receive imperfection will be more detrimental. 
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Fig. 3; Comparison between transmit/receive imperfections at 
the relay, where vs = = 0.01^, K — 10, Es^k = 

En^k = 5dB, Et = lOdB, l3sR,k = PRD,k = 1, f^Ei = 
5dB, ro = 0.2, |r_E/| = 0.8, Ar = max{itr, At = 

max{if, [INt\}, Ns = [^J, = [^J. 



Fig. 4: SE v.s. number of S-D pair K, where vs = = 

vr = v, fj.T) = fiR = fj,, Es^k = ER,k = 5dB, Et = lOdB, 
PsR.k = l3RD,k = 1 , Pei = 5dB, ro = 0 . 4 , \rEi\ = 0 . 7 , 
Nr = Nt = 200 , Ns = Nd = 10 , An = max{K, }, 

At = rnaxlFf, [| }■ 

B. Impact of Number of S-D Pairs and Channel Coherent Time 

Fig. 1^ shows the SE as a function of the number of S-D 
pairs K, where the length of pilot sequence of each source 
or destination is set to 1. When T = 300, the figure reveals 
that the SE will not increase without bound as K increases 
since SE is ultimately limited by the channel coherent time. 
In fact, as the number of S-D pair approaches to T/2, the 
SE converges to zero since all time is allocated to pilot 
phase. Meanwhile, the SE per S-D pair is a strictly decreasing 
function of K. Similar result can be observed as the channel 
coherent time is very long {T oo). This is because that the 
MUI, El and distortion noises become limiting factors when 
K is large as seen from the scaling behaviors in Table and 
Table This implies that the number of S-D pairs should be 



Eig. 5: Number of relay antennas required to achieve 3bit/s/Hz 
SE per S-D pair with different hardware qualities, where K = 

10, PsR,k = PRD,k = 1, Pei = 5dB, Ns = Nr = 

[l^J,ro = 0.4, |r£/| = 0.7. 

limited in order to ensure a SE guarantee for each S-D pair, 
and this number decreases as the quality of hardware degrades. 

C. Comparison with Relevant Schemes 

In this subsection, the SE of HIA-MM-EDR is compared 
with the massive MIMO HDR (MM-HDR) QO) and MM- 
EDR with ZE-based transceiver (ZE-MM-EDR) ifT^ . The 
channel coherent time is set to T = 300 and the length 
of pilot sequence is r = 2. The power of pilot symbol 
is Et = lOdB. Eor HIA-MM-EDR, the maximum power 
constraints at sources and relay are set to E^f^^ = 5dB and 
^max ^ KE^^^. Without JDPO, we set Es,k = E^l^, 
En^k = E'n^^/KAB. Moreover, Ar and At are set to 
Ar = max{iT, } and At = max{Ar, [| A'aJ }. When 

JDPO is applied, Es^k, ER^k, Ar and At are determined 
by Algorithm The maximum repeat time L of JDPO is 
set to 3 and the elements of subset Ar are picked uniformly 
from Ar with step max{10, At is obtained with the 

similar approach. The above parameters are chosen so that 
the performance loss due to the suboptimal search approach 
to obtain An and At is negligible. 

Pig. 1^ simulates the number of relay antennas required 
to achieve 3bit/s/Hz SE per S-D pair (which ideally can 
support the 64-QAM transmission with 1/2 channel code) 
with different levels of hardware quality, where we set vs = 
vd = vn = V and p.^ = pn = M- The figure reveals a 
tradeoff between the number of antennas and hardware quality, 
that is (as discussed in section V-D), by increasing Nn and 
Nt, we can reduce v and p linearly without degrading the 
SE. Meanwhile, it is seen that the proposed HIA-MM-EDR 
reduces the required number of relay antennas significantly 
when compared with ZP-MM-PDR. Moreover, the HIA-MM- 
EDR outperforms the MM-HDR for large v and p. This is 
because that the distortion noises become the main limiting 
factor in this case when compared with EL 

The effect of asymmetric numbers of transmit and receive 
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Spectral Efficiency (bit/s/FIz) 
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15 


Fig. 6; SE comparison with asymmetric numbers of transmit 
and receive relay antennas, where K — 10, Ns = 

Nd = PsR,k = PRD,k = 1, Pei = 5dB, tq = 0.4, 

\rEi\ = 0.7, Us = ud = i'R = 0.05, = tJ-R = 0.05. 


Fig. 8; Comparison on spectral-energy efficiency tradeoff, 
where K = 10, Pei = lOdB, Ng = [^J, Nd = [^J, 
ro = 0.2, \rEi\ = 0.8, vs = vd = vs. = 0.05, fJ-D = fJ-R = 
0.05. 


-HiA-MM-FDR 



Fig. 7: SE v.s. variance of echo interference channel Pei, 
where K = 10, Ns = [^\,Nd = [^\,PsR,k = PRD,k = 
1, ro = 0.4, \rEi\ = 0.7, us = vd = vr = 0.05, ^d = fiR = 
0.05. 

antennas at the relay is shown in Fig. where we set Nr + 
Nt = 200. It is shown that allocating more antennas to the 
receive side of relay is benehcial for HIA-MM-FDR (without 
JDPO) and ZF-MM-FDR. The reason is that, in the considered 
setup, the limiting factors of system performance are FI and 
distortion noise due to receive imperfection at the relay. From 
Theorem 1^ these factors can be suppressed by receive antenna 
array of relay. However, when JDPO is applied, it is optimal 
to set Nr = Nr. This is because that the JDPO algorithm in 
fact tries to balance the achievable rates of each hop by power 
control and adjusting Ar and Ar- This makes it unnecessary 
to allocate more antennas to receive side. 

Fig. shows the SE of HIA-MM-FDR as a function of 
variance for FI channel Pei- As expected, there exists a 
switching point between HIA-MM-FDR and MM-HDR as 


Pei increases. By increasing Nr and Nr, the constraint on 
Pei for HIA-MM-FDR to achieve a performance gain relaxes, 
which indicates that HIA-MM-FDR becomes more attractive 
when the number of relay antennas is large. Moreover, Fig. 
[^demonstrates that the proposed JDPO algorithm can reduce 
the constraint on Pei significantly. In particular, as N = 200, 
a SE gain of 7.5bit/s/Hz can be achieved by HIA-MM-FDR 
when compared to MM-HDR as Pei is 20dB. 

Fig. 0 considers the SE-EE tradeoff of different schemes. 
The large-scale fading coefficients of channels are set to 

{PsRA,- ■ ■ , Psr,k} = {0.818,0.052,1.01,0.026,0.016, 
0.803,0.051,0.383,2.85,0.448} 

{Prd,i,- ■ ■ , Prd,k} = (1.187,0.011,0.724,2.11,0.580, 
0.012,0.147,0.085,0.434,0.458} 

which is a realization generated with the model in ifT^ . The 
EEs of HIA-MM-FDR and ZF-MM-FDR are optimized by 
solving the problem in Remark [^ with JDPO algorithm and 
[12, Algorithm 1], respectively. It is observed that HIA-MM- 
FDR achieves better SE-EE tradeoff when compared with ZF- 
MM-FDR. The gain is mainly due to the optimization of Ar 
and Ar. This reason is that, if we fix Ar = Nr and Ar = 
Nr, the JDPO algorithm for EE optimization is similar to [12, 
Algorithm 1]. The only difference is that the power for each 
steam at the relay, i.e., ER^k, is optimized in JDPO algorithm 
and [12, Algorithm 1] optimizes only the total power of relay. 

VIH. CONCLUSIONS 

This paper considers the transceiver design of MM-FDR 
with hardware impairments. A low complexity HIA scheme 
is proposed to mitigate the distortion noises by exploiting 
the statistical knowledge of channels and antenna arrays at 
sources and destinations. A joint degree of freedom and power 
optimization algorithm is presented to further optimize the 
SE of HIA-MM-FDR. The analytic results demonstrate that 
the proposed scheme can mitigate the “ceiling effect” appears 






















































































15 


in traditional MM-FDR protocol, if the numbers of antennas 
at sources and destinations can scale with that at the relay. 
Moreover, simulation results show that the HIA-MM-FDR 
outperforms MM-FDR with traditional transceiver scheme. 


Appendix 

A. Useful Lemmas Related to the Channel Estimates 

Lemma 1: Let and be the LMMSE esti¬ 
mates of and j., respectively. Dehne 6^^ = 

. In 



’ 

2 

’ r 


2' 

E 



and 5^^ = E 

Nf^X,k^RD,k 



the large-(A^jj, Nt) regime, we have 

, 2 


E 


• 

2 



2' 

ifsit.feifsit./c 


= E 




1 


tEt 

1 

T Ex 
1 

T Ex 


E 


E 


E 


\SR,k&SR,k^ SR,kPR^SR,k^ SR,k^ 


^SR,k^SR,k^SR,kPR Rsfl.fc 


hHi 


hsR,k&SR,k^SR,k^R '^SR,k 




4 (iv)] shows that E 
2 


h:SR,kQsR,k^ SR,kn:SR,k 

■ Replacing with P^UsR,kPs,k and 

using the following large approximation (obtained based 
on [17, Lemma 4 (ii)]) 


'H.SR,kPRQsR,k^SR,kPR 'ilsR,k 

-'Vfl 

= -^l3sR,kCyi,X^j,^,&Jl,pRCsn,k 


Nr 

SI 

1 


X rsR,kP^cl%,XsR,kCyi, 


= ^/3si?,fcTr ^Cgjij,rsR,kpRCsR,kpR) CsR,k 


It can be shown that 
E _ 

^ tdiag (p^fcPs.fc) C 


hsR,kQsR,k^SR,kpRiisR,kT^SR,k^* 


= vs,kTEx 


PlkC 


SR,k'- 


SR,kPS,k 


(47) 


(Tr 


-2 


(a.(< 


'SR,k 


gSR ^ ^ P^fediag {ps,kPs,k) Ps,k 

+ ? (44) 

X Tr (c|j^ 


The expression of 5^^ can be obtained by replacing f., 
&SR,k’ SR,k, ^'s.fc and ftg^k in (^i with 
TfiD.fc, i'D,k and px^k, respectively. 

Proof: We show the proof for 5^^ and the derivation 
for 5^^ is similar. According to the uncoirlation between 
and hg^ j,, we have 5^^ = E[|Ahgg fchggj,| ] = = 
E[|hf^,,hsit,fel'] - (Tr(Cgj,,fc))^ With (g and (|^i, 
E[|hf^,fehg^ ] can be written as 


Using the similar approach on the remaining terms in (45 1 , 
Lemma [T] is obtained. ■ 

Lemma 2: Let C^ f. and (i G {577, RD}) be the 

covariance matrices of effective channel ^ and its esti¬ 
mates hj f. given by Theorem I If Tr(Cg;j,,) = 0{Nr) 
and Tr(e^^ fe) = 0{Nx), we have Tr(Cg^^fe) = 0{Nr) 
and Tr(Cpri u) = 0{Nx)- Moreover, lim ^ > 0 and 

lim ^ > 0 (i.e., Ag and Ax scale with 0(Nx) and 

Nt—A oo 


0{Nx), respectively). 

Proof: We present the proof for Tr^gg. and Ar. 
Using dehnition of Cgg ^ in Theorem 3 and eigenvalue 
decomposition Cgg. ^ = UEgg we have 

Tr {p.SR,l^ = Tr {Cgx,k^SR,kQsR,k) 

{p.SR,k{^l^SR,k + ^SR,k 

E(Ai {CsR,k))\^l^l {CsR,k)+^2) 


= Tr 

Ar 


1=1 


(48) 

where wi and 012 are positive constants independent of Ar. 
From (48 1 , Tr(Cg^ ff) is bounded as 


(Az {QsR,k)) 

^ wiAi (Cg^ ^) -f 012 


< Tr 


(Cgg < ^Tr {Cgx f.) 


(49) 

In (49 1 , the upper bound is an 0{Nr) term. With the dehnition 
in the Theorem we have 


of QsR,k 


(45) 

In the \cirge-{Nfi, Nx) regime, the result in [17, Lemma 

^ 2" 

Tgo the 


Ai (Cgg. f.) — /3sfl,fcAi (^CsR^k'^Ui {Cgx.k) 

P^CsR^kpRUi (CsR^k) (50) 

PsR.k^l (CsR^k^ Ai (CsR^k) = O (1) 


X 

< 


The last step is based on assumption AI. Since Tr [CgR fc) = 
0{Nx), we can conclude that Ai [CgR fc) = ^(1) (otherwise 
Tr (Cgx.k) A ArXi {Cgx,k) < 0 {Nr)). Thus, the lower 
bound in (491 also scales with 0{Nr). Moreover, based on 
the dehnition of Cgg. in the Theorem we have 

Tr (Cgg. ^,) = PsR.k^l [CgR^k] pR.l^SR.kpRJ 


(51) 


< 


4lit/3si?,fcAi (^CsR.k'j Ai (CgR^k) 


(46) 


where pR^i is the hh column of P^. From (51 1 , Ar must 
scale 0{Nr). Otherwise, Tr (Cgg. ^,) < 0{Nr) based on 
assumption AI. 
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B. Proof of Theorem 

To facilitate analysis, we first derive the covariance matrices 
of distortion noises t5[u] and Based on the model 

([^, we have 0^ = Ai?ig{vs,iEs^i, ■ ■ ■ ,’^S,kEs,k)- More¬ 
over, dehne xg [u] = [«],••• ,^s,k I 

expressed based on model Q 


0^ can be 


as 


®R = Mitdiag (E [Hs^risHf^]) 


/r_Rdiag (E ) + (J-rInr 


where 

^ 


\H 


(xs [m] + ts M) (s [w] + ts [m]) 

= diag ((1 + i^s,i) Es,i, ■ ■ ■ , (1 + vs,k) Es,k) 


flu = E 


= E 


{xr [u] +ti^ [u]) {XR [m] 
XR [u] xg [u]] + 0g 


■ti? [u]) 


H 


E = '^usjEs,iE 


/=i 

K 

E 

/=i 


(1 + ^s,i)Es,il3sR,iCsR,i 


of E I I can be obtained as in (|55|l, where 


Then we derive the asymptotic bound in Theorem □ By 


r0THf, in Q, 


with diag 

H 




SR'^S 

0lHg,) + 0R 


®cHf^ - 


R T i-Nn- 

bound on 


replacing Uei'^r±^ei 
we approximate as 

’^S,kEs,k'^SR,k'i^SR,k + diag (FLeI^r^^eI 
The approximation results in a new upper 

since it reduces the power of residual El after 
combining by ^ (The new bound will also be 
referred to as for convenience). Based on [17, 

Lemma 4 (ii)], we can replace diag (H£;7 0gHgj) with 
its deterministic equivalence in the large-TYr regime 


1 

N'T 


diag (H7,7©gHf,) = 


the expression of R^gg*^^ in Theorem 111 can be obtained by 
hrst applying [17, Lemma 4 (ii)] on fc, and then 

using [17, Theorem 1]. 

C. Proof of Theorem 

Since the scenario of interest is the large-(A^fl, Nr) regime, 
we need only consider the effect of 'WT,k on Rggg (given 
by Theorem [^, more precisely, the term Tr(C£;/0g). With 
the assumption we have diag(CB7) = Thus, 

using the model (Q, we have Tr(C£)70g) = Tr(0g) = 


1 

Nt 


in 

/3B/Tr(C7570g) 


Wfl- 


(52) 


(53) 


Lor further analysis, we approximate E [xr [u] xg [it]] in 
(53 I with diag(E [xtj [m] xg [u]]) = ^0g. Note that the 
approximation results in a new upper bound on Rggg (The 
new bound will also be referred to as Rggg for convenience). 
Straight-forward computations yield to 

K 


^'7^Tr (diag (E [x77[u]xg[M]])) = vr.Ya=i^RE which is 
independent of 

Then we derive the upper bound in Theorem 
Using the model 0, the power of received distor¬ 
tion at the relay can be derived as E ||r7)fe[?7]|| = 

PD,k (j^rPRD^kER^k + RRNTPRD,k ^R,i + l)' More¬ 
over, in the large Nr regime, the term hg^ h7773,fe in ^ 
approaches to hg^, j.0gh777)_fe = l3RD,kTi- {CRD,k®R) = 
VRl5RD,k Er,i [12, Lemma 4 (ii)]. Therefore, it is suffi¬ 
cient to design Wr i^. to maximize the numerator of (0, which 
is exactly the eigen BL scheme. The resultant upper bound 
on achievable rate can be obtained by applying [17, Lemma 
4 (ii)] and assumption A2 on the numerator of 0. 

D. Proof of Theorem 

With the expression of LMMSE estimator li23l . we have 

hsit.fc = HhsR,k^SR,k\m^SR,k2^R,k\)~^^SR,k- Using the 
independence between ^ and distortion noises, we have 






^SR,k^SR,k_ 


= E 


(54) 


^R^SR,kt>S,kt>S,k^SR,k^R 
= PsR,k^l (CsR^k'j PgCs77_fcP77 (56) 
— C 

— '^SR,k 

where the second step follows from a similar derivation with 


that in (551. Moreover, E 


^SR,k^SR,k 


can be expressed as 


Lollowing the spatial correlation model of H7;7 in sec¬ 
tion III and the approximation under ([5^, the expression 


E [zs77_fczg77g.] — Cg77 j. 

1 


the third step follows from the formula vec (TABc) = 
(c^ (g) A) vec (B) and the fourth step is due to the fact that the 
elements of 'S.ei are i.i.d with distribution CATO, 1). Inserting 
54|l, (551 into (52i, the expression of 0g in (101 is obtained. 


tEj 

1 

tEi 


tEr 


E 


E 


E 


F^nsR,kTsR,krf^T^R,kiiUk'PR 


iH 


PgRs7t.fc<^*</>^R^ 


Sfl.fcPfl 


PgN5i7.fcr</>"’Nf77,,P77 


(57) 

According to the independence between HsR^k and Tstj ^, 


the second term of right-hand side of (571 can be rewritten as 


E 


P^UsR,kTsR,kr<I^^T^R,kiisR,kPR 


= i^s,kTERE 


PgHsTt.fcdiag (ps.fepgfe) Hf77 fcP 


= ’^s,kTE^/3sR,k"FT^ (Csfl.fcdiag (ps,fcpgfc)j pRCsR^kPR 


Linally, Using the similar approach on the remaining terms of (57i, 


and substituting the resultant expression of (571 and (561 into 
the expression of LMMSE estimator, the result in Theorem 
is obtained. Using (|26]l-(|27]i, the second order statistics of 
hgTj 7. and AhgTjin Theorem can be easily verihed. 

E. Proof of Theorem 

We show the proof for E[Elfe] and E[Dg ^.j due to the space 
limitation. The proof for other terms in Theorem His similar. 


Note that 






- h^h - x^C^'c'^'x- ■ 


(i S {SR, RD}). Based on Lemma the dimensions of 
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E [HEinaU^j] 


1 + —] Pei'E 
vrJ 

\ 

— 

VrJ ^ 


= 1 + 


^1/2y ^1/2 

-^EI'^EI ^r'^ei -^ei'^ei 
1/2 




1 


/=1 

TVt 


C^4^X^,C^//(0T) e;er(©L) C^7 X^/Cb7 


aT \ ^ i^l/2-y^^ /^l/2 


Z=1 

N'j- 


(c]if(^&iy^\i 


^/'vec (X^5/) 


1 H-) Pei ^ 




■,1/2 

■"B7 


(55) 


1 + /3B7Tr (Cb/©t ) Cei 


^SR,j ™d yiRD,j (i-C-, ^77 and At) approach to infinity as 
mm{NR, Nt} oo. Thus, according to [17, Lemma 4 (ii) 
and (iii)], in the large-(A^7j, regime, we can deduce 


1 ---77 --- 


Nr 


Nt 


^srMsr 


Nr 

^^RD±±RD 


^■*“8 ( 


^SR,1 J ; 


,Tr 


1 

Nt 


diag (Xr (c_ed,i j , 






-1 


fined as = E[(xfl [u] + [u]) [u] + [u]) ] = 

E [WrAi^Wlf] + J/i^diag (E [WrA^Wf]). Inserting (jTTi 
into the expression of CIr and using ([58]l, we can obtain 


K 


^R = ^ Er^ 


PtCrd,I^T + ^ 


■fldiag (PtQed^i'Pt 


Z=1 


Tr 




In the large-(A^;j, TVr) regime, using (581, we have 


E 


E[EL1 = 


hsR,k^RE^EI^Ri^EI^RhsR,k 


{rr^csRyy 


E 


Tr C 


,^1/2 - Al/2 


^SR,k^RE^EI^R"iiEI^RG-SR,k 


{Tr^Qs^yy 


effect on the scaling behavior, we have 


E[EhJ = 


/37J7(Tr(( 


-SR,k 


, K 

'^£;K/(Tr(c«^,)) 


1^1 


xTr (pED^i'P^CEiPyTr (C 


^SR,kPR^ElPR 


(58) 

1) Derivation o/E[E4]; 

(a) Asymptotic Expression: Based on the model ffl, 
we have E [ti^ [w] tf [u]] = B^diag (E [WtA^WI?^. 

Since Hei and are independent, El^ can be 

rewritten as E[Elfe] = Elw^j^HEi^RpL^jS/VR^k] = 


\elimsRUsR) rnsR^t^Ei^Tn where is de- 


J£^ Er^iXnr-Ar + I {Cei)^Nt-At + 1 ( Cb7 ) 

- VJ^\ -^ 

I=i Tr [QsR,kJ 


where the second step follows from the def¬ 

inition of pR and Pt- Based on Lemma 
(Tr(Cgjj = 0{N)^^). Thus we can conclude that 

E [Elfe] < 0{Xnr-Ar+i (Cei) Xnt-At+i{^ei)KN)^^). 

2) Derivation o/E[D]j 

(a) Asymptotic Expression: Based on the model (fn, the co- 
variance matrix of tsj can be derived as E [tg^ [u] tg ^ [u]] = 
Es.jEs idia.g (ps^^-p;^^-). Thus, in the large-(iVi?, Nt) regime. 


using (58 I E[D|j can be expressed 


®[f^77,fe] — (Tr (©5^ ^,^^ ES,jEs,k 


X E 


£s77,fcP77Hs77,fcdiag (ps,feP;^fe) E^SR,kPRhsR,k 


(59) 


-2 ^ 

-f (Xr (©5^. ^ esjEs^ 


X E 


,77 


/=! 

£s77,fcPBHs77,/diag (psjp^^) ^ 


9kj 


(60) 

According to the independence between ^ and ^ 
when k ^ j, gkj can be derived as 


The second step is based on the property Tr(AB) = Xr(BA) 
and independence between ^ and Hei- Finally, using a 
similar derivation with (55 i, E [El^] in Theoremj^is obtained. 

(b) Scaling Behavior: Substituting ( [59] l into the expression 
of E [Elfe] in Theorem]^ and neglecting the terms that have no 


9kj — 


/3sflj Tr (( 


£577,^^77^577 ,/Pt? 


(61) 


Moreover, substituting (261 and (271 in Theorem into ( [60| ) 
and using the large-A/j approximation in ([47]), it is straight- 
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forward but tedious to show that gkk can be derived as 
gkk = (xr pffcdiag (ps.fcpffc) Ps.fc 

,Tr ^(diag (ps.fcP;^?,) CsR,k 

g-R {^SR,k^S,k^'j Tr 


Tr 


(p-SR,k^SR,k 


(CsR,h 


tEt 


{pSR,j?j 


(62) 

j,y. Inserting 

(611 and (621 into (60i, the expression of E[D)j f,] is obtained. 


where = Ps.fcP^fc + J^s.fediag (ps.fcP;^, 


Jh) Scalmg Behavior: Using the similar approach as that 
for E[Elfc], it can be shown that, as Nr is fixed, the first and 
second terms on the right-hand side of (|60| ) scale as 0(1) 
and 0{KN^^), respectively. Thus, E[D^ j,] = 0(1) since 
K < Nr. As Nr scales with Nr, it is shown in ifT^ that the 
eigenvectors of Cri form a unitary DFT matrix in the large 
Nr regime. In this case, we have diag(ps_fep;5 ^) = 1/iVs. 
Applying this property and Lemmaj^on (j6^, it can be verified 
that the first term on the right-hand side of ( |60| ) scales as 
0(1/ Nr) as Nr scales with Nr. This completes the proof. 
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